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Abstract — The  problem  of  hypothesis  testing  against  indepen¬ 
dence  for  a  Gauss-Markov  random  field  (GMRF)  is  analyzed. 
Assuming  an  acyclic  dependency  graph,  an  expression  for  the 
log-likelihood  ratio  of  detection  is  derived.  Assuming  random 
placement  of  nodes  over  a  large  region  according  to  the  Poisson 
or  uniform  distribution  and  nearest-neighbor  dependency  graph, 
the  error  exponent  of  the  Neyman-Pearson  detector  is  derived 
using  large-deviations  theory.  The  error  exponent  is  expressed  as 
a  dependency-graph  functional  and  the  limit  is  evaluated  through 
a  special  law  of  large  numbers  for  stabilizing  graph  functionals. 
The  exponent  is  analyzed  for  different  values  of  the  variance 
ratio  and  correlation.  It  is  found  that  a  more  correlated  GMRF 
has  a  higher  exponent  at  low  values  of  the  variance  ratio  whereas 
the  situation  is  reversed  at  high  values  of  the  variance  ratio. 


(a)  Wi:  Gauss-Markov  random  field  (b)  Ho  :  Independent  observations, 

with  nearest-neighbor  dependency. 

Fig.  1.  Illustration  of  the  hypothesis-testing  problem. 


Index  Terms — Detection  and  Estimation,  Error  exponent,  Gauss- 
Markov  random  fields,  Law  of  large  numbers. 

I.  Introduction 

Or  distributed  detection,  the  so-called  conditionally  IID 
assumption  is  mathematically  convenient  and  is  widely 
assumed  in  the  literature.  The  assumption  states  that  condi¬ 
tioned  on  a  particular  hypothesis,  the  observations  at  sen¬ 
sors  are  independent  and  identically  distributed.  In  practice, 
however,  spatially  distributed  sensors  often  observe  correlated 
data,  since  natural  spatial  signals  have  stochastic  dependence. 
Examples  of  correlated  signals  include  measurements  from 
temperature  and  humidity  sensors,  or  from  magnetometric 
sensors  tracking  a  moving  vehicle.  Audio  data  is  also  rich 
in  spatial  correlations,  due  to  the  presence  of  echoes. 

Spatial  random  signals  are  typically  acausal  in  contrast 
to  temporal  signals.  In  the  literature,  the  two  are  usually 
distinguished  by  referring  to  acausal  signals  as  random  fields 
(RF)  and  causal  signals  as  random  processes  (RP).  Random 
fields  are  of  interest  in  a  variety  of  engineering  areas  and 
may  represent  natural  phenomena  such  as  the  dispersion  of 
atmospheric  pollutants,  groundwater  flow,  rainfall  distribution 
or  the  mesoscale  circulation  of  ocean  fields  [2]. 

In  this  paper,  we  consider  the  problem  of  hypothesis  testing 
for  independence,  shown  in  Fig.l.  Specifically,  under  the 
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alternative  hypothesis,  sensors  collect  samples  from  a  Gauss- 
Markov  random  field  (GMRF),  whereas  the  samples  are  in¬ 
dependent  under  the  null  hypothesis.  We  model  the  GMRF 
through  a  graphical  approach,  in  which  a  dependency  graph 
(DG)  specifies  the  stochastic  dependence  between  different 
sensor  observations.  This  dependency  graph  can  have  different 
degrees  of  sparsity  and  can  even  be  fully  connected.  However 
typically,  spatial  interactions  are  based  on  proximity,  where 
the  edges  are  included  according  to  some  specified  rule  based 
on  the  local  point  configuration  [3],  [4].  With  a  regular  lattice 
structure  (e.g.,  in  image  processing,  Ising  model),  a  fixed  set 
of  neighbors  can  be  specified  in  a  straight-forward  manner 
[5].  However,  the  situation  is  more  complicated  for  arbitrarily 
placed  nodes.  In  this  paper,  we  consider  the  nearest-neighbor 
graph  (NNG),  which  is  the  simplest  proximity  graph.  The 
nearest-neighbor  relation  has  been  used  in  several  areas  of 
applied  science,  including  the  social  sciences,  geography  and 
ecology,  where  proximity  data  is  often  important  [6],  [7]. 

We  consider  the  Neyman-Pearson  (NP)  formulation,  where 
the  detector  is  optimal  at  a  fixed  false-alarm  probability.  But, 
under  this  formulation,  analysis  of  performance  metrics  such 
as  error  probability  is  intractable  for  an  arbitrary  number  of 
observations.  Hence,  we  focus  on  the  large-network  scenario, 
where  the  number  of  observations  goes  to  infinity.  For  any  pos¬ 
itive  fixed  level  of  false  alarm  or  the  type-I  error  probability, 
when  the  mis-detection  or  the  type-II  error  probability  Pm  (n) 
of  the  NP  detector  decays  exponentially  with  the  sample  size 
n,  we  have  the  error  exponent  defined  by 

D:=-  lim  -logPM(n).  (1) 

n — kx>  ft 

The  error  exponent  is  an  important  performance  measure  since 
a  large  exponent  implies  faster  decay  of  error  probability  with 
increasing  sample  size. 
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Additionally,  we  assume  that  the  sensors  observing  the  sig¬ 
nal  field  are  placed  i.i.d.  according  to  the  uniform  or  Poisson 
distribution.  Since  nodes  are  placed  irregularly,  it  results  in  a 
non-stationary  GMRF  (for  the  definition  of  stationary  GMRF, 
see  [8,  P.  57]).  We  assume  that  the  number  of  nodes  goes 
to  infinity,  by  way  of  the  coverage  area  of  the  nodes  going 
to  infinity,  while  keeping  the  node  density  fixed.  Under  this 
formulation,  we  derive  the  detection  error  exponent,  assuming 
access  to  all  the  observations. 

A.  Related  Work  and  Contributions 

The  kind  of  hypothesis  testing  we  consider  is  called  testing 
for  independence.  In  [9],  [10],  problems  of  this  kind  are 
considered  with  rate  constraints  on  the  channels  and  for  two 
sources,  using  a  large  number  of  samples  at  each  source.  In  this 
paper,  we  assume  that  there  are  no  constraints  on  the  channel 
and  that  the  observations  have  the  correlation  structure  of  the 
GMRF.  Our  formulation  is  different  since  there  is  a  single 
observation  at  every  sensor,  and  the  number  of  sensors  goes 
to  infinity. 

GMRF  is  also  known  as  conditional  auto-regression  (CAR) 
in  the  seminal  work  of  Besag  [11],  [12].  They  have  a  wide  ar¬ 
ray  of  applications  in  fields  such  as  speech  recognition,  natural 
language  processing,  coding,  geo-statistics,  image  analysis  and 
AI.  The  literature  is  too  vast  to  mention  here.  For  an  exposition 
on  GMRF,  see  [8],  [13]. 

Another  related  problem  is  the  detection  of  Gauss-Markov 
random  processes  (GMRP)  in  Gaussian  noise,  which  is  a 
classical  problem  [14].  There  is  an  extensive  literature  on 
the  large-deviations  approach  to  the  analysis  of  detection 
of  GMRP  [  15]— [25],  but  closed-form  expressions  have  been 
derived  only  for  some  special  cases,  e.g.,  [26]-[28].  GMRP 
has  been  characterized  via  inversion  algorithms  for  block- 
banded  matrices  [29],  [30].  However,  these  approaches  are  not 
amenable  to  the  extension  of  the  problem  to  planar  and  higher 
dimensional  spaces,  since  they  deal  with  random  processes 
rather  than  random  fields,  or  to  the  random  placement  of 
nodes. 

Related  to  the  GMRF,  there  is  an  alternative  and  more 
restrictive  approach,  known  as  the  spatial  auto-regressive 
model  (SAR)  and  has  been  extensively  studied  in  the  field 
of  spatial  data-mining.  In  [7],  this  formulation  is  considered 
with  (directed)  nearest-neighbor  interaction  and  a  closed-form 
ML  estimator  of  the  AR  spatial  parameter  is  characterized.  We 
do  not  consider  this  formulation  in  this  paper. 

To  our  knowledge,  large-deviation  analysis  of  the  detection 
of  acausal  non-stationary  GMRF  has  not  been  treated  before. 
We  first  express  the  likelihood  function  of  a  GMRF  with  an 
arbitrary  acyclic  dependency  graph,  in  terms  of  its  covariance 
matrix.  The  joint  distribution  can  also  be  derived  by  expressing 
it  in  terms  of  the  marginal  probability  of  the  nodes  and  the 
joint  probability  at  the  edges  of  the  dependency  graph  [31], 
[32], 

We  consider  the  detection  problem  represented  in  Fig.l, 
under  the  additional  assumptions  of  nearest-neighbor  depen¬ 
dency.  We  consider  the  location  of  the  sensors  as  a  random 
point  set  drawn  from  uniform  or  Poisson  distribution  and 


defined  on  expanding  regions.  This  framework  allows  us  to 
exploit  recent  advances  in  computational  geometry  [33],  [34]. 
By  casting  the  error  exponent  as  a  limit  of  the  sum  of  graph 
functionals,  we  are  able  to  apply  the  special  law  of  large 
numbers  (LLN)  for  functionals  on  graphs  derived  in  [33]. 
We  obtain  the  final  form  of  the  exponent  by  exploiting  some 
special  properties  of  the  NNG.  We  then  numerically  evaluate 
the  exponent  for  different  values  of  the  variance  ratio  and 
correlation,  for  exponential  and  constant  correlation  functions. 
We  conclude  that  at  a  fixed  node  density,  a  more  correlated 
GMRF  has  a  higher  exponent  at  low  values  of  variance  ratio, 
whereas  the  opposite  is  true  at  high  values  of  variance  ratio. 

B.  Notation  and  Organization 

Vectors  and  matrices  are  written  in  boldface.  Random 
variables  are  in  capital  letters,  random  processes  and  random 
fields  in  boldface  capitals  and  sets  in  calligraphic  font.  For 
the  matrix  A  =  [A(i,j)],  A(i,j)  denotes  the  element  in  the 
i,b  row  and  j'h  column  and  |A|  its  determinant.  For  sets  A  and 
B,  let  A\B  =  {i  :  i  G  A,i  ^  B}  and  let  |  •  |  denote  cardinality. 

An  undirected  graph  Q  is  a  tuple  Q  =  (V,£)  where  V  = 
{l,2,...,n}  is  the  vertex1  set  and  £  =  i,j  €  V,i  ^ 

j}  is  the  edge  set.  When  i  and  j  have  an  edge  between  them, 
i  and  j  are  neighbors  denoted  by  i  ~  j  (otherwise  it  is  i  ^  j ). 
For  a  directed  graph,  we  denote  the  edges  by  £  =  {<  i,j  > 
,  t,j  S  V,i  ^  j},  where  the  direction  of  the  edge  is  from  i 
to  j.  The  neighborhood  function  of  a  node  i  is  the  set  of  all 
other  nodes  having  an  edge  with  it,  i.e., 

Af(i)  =  {j  G  V  :  j  ±  i,  {i,j)  G  £}.  (2) 

The  number  of  neighbors  of  a  node  i  is  called  its  degree, 
denoted  by  Deg(i).  A  node  with  a  single  edge  i.e.,  its  degree 
is  1  is  known  as  a  leaf  and  the  corresponding  edge  as  a  leaf 
edge,  otherwise  it  is  known  as  an  internal  or  interior  edge.  Let 
dist(i,  j)  be  the  Euclidean  distance  between  any  two  nodes. 
Let  Rij  denote  the  (random)  Euclidean  edge-length  of  (i,j) 
in  graph  Q  =  (V,£), 

Rij  =  dist (i,j),  V  (i,  j)  G  £.  (3) 

Our  paper  is  organized  as  follows.  We  provide  a  description 
of  the  GMRF  in  section  II,  focusing  on  the  acyclic  dependency 
graph  in  section  III  and  providing  an  expression  for  the 
likelihood  function  in  section  III-A.  We  define  the  hypothesis¬ 
testing  problem  in  section  IV  and  specify  additional  assump¬ 
tion  on  the  covariance  matrix  of  the  GMRF  in  section  IV-A. 
In  section  IV-B,  we  assume  additionally  that  the  dependency 
graph  is  the  nearest-neighbor  graph.  We  provide  an  expression 
for  the  log-likelihood  ratio  in  section  IV-C.  We  define  the  error 
exponent  under  the  Neyman-Pearson  formulation  in  section  V 
and  specify  the  random  placement  of  nodes  in  section  V-A.  In 
section  VI  we  evaluate  the  error  exponent,  expressing  it  as  a 
graph  functional  in  section  VI- A,  applying  the  LLN  for  graphs 
in  section  VI-B,  and  providing  an  explicit  form  for  NNG  in 
section  VI-C.  We  provide  numerical  results  for  the  exponent 
in  section  VI-D,  and  section  VII  concludes  the  paper. 

*We  consider  the  terms  node,  vertex  and  sensor  interchangeable. 
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(a)  A  labeled  simple  undirected 
graph. 
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(b)  X  :  Non-zero  elements  of 
potential  matrix. 


Fig.  2.  Dependency  graph  and  potential  matrix  of  a  GMRF. 


II.  Gauss-Markov  Random  Field 

A  GMRF,  in  addition  to  being  a  Gaussian  random  field, 
satisfies  special  conditional  independence  properties.  A  simple 
example  is  the  first-order  auto-regressive  process,  where  the 
conditional  independence  of  the  observations  is  based  on 
causality.  However,  a  spatial  random  field  has  a  far  richer 
set  of  conditional  independencies,  requiring  a  more  general 
definition  [8,  R  21], 

Definition  1  (GMRF):  Given  a  point  set  V  =  {1 
Y y  =  {  V j  :  i  £  V}isa  GMRF  with  an  (undirected) 
dependency  graph  Q(V,£)  if  Yy  is  a  Gaussian  random  field, 
and  Vi,j  £  V,  Y,  and  Y:l  are  conditionally  independent  given 
observations  at  all  other  nodes  if  i  and  j  are  not  neighbors, 
i.e.. 


Yi  ±  Yj  |  Y_y  <«=►  i*  j,  Vi,  j  £  V,  j ,  (4) 

where  _L  denotes  conditional  independence  and  Y _lj-.=(Yk  : 
k£V,k^  i,j). 

A  common  approach  to  formulating  a  GMRF  is  to  specify 
the  dependency  graph  through  a  neighborhood  rule  and  then 
to  specify  the  correlation  function  between  these  neighbors. 
Thus,  in  a  GMRF,  local  characteristics  completely  determine 
the  joint  distribution  of  the  Gaussian  field. 

The  following  Markov  properties  are  equivalent  in  a  GMRF: 

1)  pairwise-Markov  property 


of  a  dependency  graph  are  independent.  As  an  illustration,  in 
Fig. 2  we  have  Y§  _L  Yj  given  the  rest  of  network,  Y\  _L  Y%\Y 3, 
and  so  on. 


III.  Acyclic  Dependency  Graph 

A  special  case  of  the  dependency  graph  is  an  acyclic 
or  a  cycle-free  graph.  Here,  the  neighbors  of  a  node  are 
not  themselves  neighbors.  The  joint  distribution  is  somewhat 
easier  to  evaluate  in  this  case.  We  note  that  an  acyclic  graph 
with  at  least  one  edge,  always  has  a  leaf  i.e.,  it  has  a  node 
with  degree  1  and  has  utmost  n—  1  edges  in  a  n-node  graph. 

The  covariance  matrix  £  of  a  GMRF  satisfies  some  special 
properties.  For  instance,  consider  the  cross  covariance  between 
the  neighbors  of  a  node,  i.e.,  nodes  that  are  two  hops  away 
in  an  acyclic  DG.  By  the  global-Markov  property  we  have2, 
assuming  £(i,  i)  >  0,  for  i  £  V,  Deg(i)  >  2,  j,  k  £  A f(i),j  7^ 
k, 


%(j,  k ) 


£  (i,j)T,(i,k) 

£(M) 


For  example,  in  Fig. 2, 


=  £(1,3)£(2,3) 
£(3,3) 


(8) 


(9) 


We  can  similarly  find  an  expression  for  the  covariance  between 
any  two  nodes  of  the  GMRF.  Thus,  the  covariance  matrix  of 
a  GMRF  with  acyclic  dependency  can  be  expressed  solely 
in  terms  of  the  auto  covariance  of  the  nodes  and  the  cross 
covariance  between  the  neighbors  of  the  dependency  graph. 


A.  Potential  Matrix 

The  inverse  of  the  covariance  matrix  of  a  non-degenerate 
GMRF  (i.e.,  with  a  positive-definite  covariance  matrix)  is 
known  as  the  potential  matrix  or  the  precision  matrix  or  the 
information  matrix.  The  non-zero  elements  of  the  potential 
matrix  A  are  in  one  to  one  correspondence  with  the  edges  of 
its  graph  Q(V7£)  [8,  Theorem  2.2]  in  the  sense  that 


Yi±Yj  |Y_y  <*=»  (5) 

2)  local-Markov  property 

Yi  -L  | Y^^j) ,  (6) 

3)  global-Markov  property 

Y.4  JL  Ye|Yc,  (7) 

for  disjoint  sets  A,  B,  and  C,  with  A  and  B  non-empty, 
where  the  set  C  separates  A  and  B  i.e.,  on  removing  the 
nodes  in  C  from  the  graph,  nodes  in  A  are  no  longer 
connected  to  the  nodes  in  B. 

Thus,  in  (6),  the  local-Markov  property  states  that  the  condi¬ 
tional  distribution  at  a  node  in  the  DG  given  the  observations  at 
its  neighbors  is  independent  of  the  rest  of  the  network.  By  the 
global-Markov  property  in  (7),  all  the  connected  components 


i*j  A(i,j)  =  0,  Vi,  j  £  V,i  7^  j,  (10) 
and  is  illustrated  in  Fig. 2. 

This  simple  correspondence  between  the  conditional  inde¬ 
pendence  of  the  GMRF  and  the  zero  structure  of  its  potential 
matrix  is  not  evident  in  the  covariance  matrix,  which  is 
generally  a  completely  dense  matrix.  Therefore,  it  is  easier 
to  evaluate  the  joint  distribution  of  the  GMRF  through  the  po¬ 
tential  matrix.  In  practice,  however,  estimates  of  the  covariance 
matrix  are  easier  to  obtain  through  the  empirical  observations. 
Therefore,  it  is  desirable  to  have  the  joint  distribution  in  terms 
of  coefficients  of  the  covariance  matrix.  Thus,  an  explicit 
expression  between  the  coefficients  of  the  covariance  and  the 
potential  matrix  is  needed.  We  provide  such  an  expression 
and  also  obtain  the  determinant  of  the  potential  matrix  in  the 
theorem  below. 

2For  X,Y  jointly  zero  mean  Gaussian,  E(X|y)  =  E^Ejjjjy. 
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Theorem  1  (Elements  &  Determinant  of  Potential  Matrix): 
The  elements  of  the  potential  matrix  A:=£  ,  for  a  positive- 

definite  covariance  matrix  E  and  acyclic  dependency  graph 
G(V,£),  are 


A(i,i)  = 


£(m) 


('  +  E 


ieAT(i) 


s(M)£(j,j)  -  j)2' 

if  i  ~  j. 


A(i,j)  =  {  Z(i,i)Z(j,j)-Z(i,j)a  J’  (11) 

0  o.w. 


The  determinant  of  the  potential  matrix  of  A  is  given  by 


|A|  = 


1 

W\ 


nievs(i,oDegw-1 

n  j)  -  £(f,  j)2] ' 

i~j 

i<j 


(12) 


Proof:  The  proof  is  based  on  acyclicity  of  dependency 

graph.  See  Appendix  A.  □ 


IV.  Hypothesis-Testing  Problem 
Let  V  =  {1, . . . ,  n}  be  a  set  of  n  nodes  on  the  plane  and  let 
Y„  be  the  random  vector  of  observation  samples  Y,  .  i  <£  V, 

Y„:=[Yi, . . . ,  Yn]T .  (13) 

The  hypothesis-testing  problem  is  as  follows  (also  see  Fig.l), 


H0  ■  Yn  ~  Af(0,  o-qI)  vs.  Ht  :  Y„  ~  A"(0,  S1)V),  (14) 

where  S1;y  is  a  positive-definite  covariance  matrix  under  the 
alternative  hypothesis  and  is  dependent  on  the  configuration 
of  nodes  in  V  and  (Tq  >  0  is  the  uniform  variance  under  the 
null  hypothesis. 

The  optimal  decision-rule  under  both  NP  and  Bayesian 
formulations  is  a  threshold  test  based  on  the  log-likelihood 
ratio  (LLR).  Let  p[Y„  |V:  Hj\  be  the  conditional  PDF  of  the 
observations  given  the  point  set  V  under  hypothesis  j.  The 
LLR  given  by 


Tirwv  in  1  PlYn,V;Ho\  ,  p[Yn-,Ho\ 
LLR(Y„,  V)  :=  log  ,  =  log  ZT^7  ^  (15) 


’  p[Y„,  V;  H\] 


'p[Yn\V-,Hi 


\  (log 


->i,v  I 


Y«  Pr,v  —  (<JoI)_i]Y 


where  in  (15),  we  have  used  the  fact  that  the  sensor  observa¬ 
tions  are  independent  of  V  under  Hq. 


A.  Covariance  Matrix  of  GMRF 

We  make  additional  assumption  on  the  structure  of  the 
covariance  matrix  Eyy  of  the  GMRF  under  H\  viz.,  that 
the  nodes  have  the  same  measurement  variance  for  any  node 
configuration  V,  i.e., 


We  denote  the  ratio  between  the  variances  under  the  alternative 
and  the  null  hypothesis  at  each  node  by 


K:= 


oi 


(17) 


We  also  assume  that  under  Hi,  the  amount  of  correlation 
between  the  neighbors  i,  j  of  the  dependency  graph  is  specified 
by  an  arbitrary  function  g,  which  has  the  Euclidean  edge  length 
Rj  j  as  its  argument.  From  (16),  we  have 


g(iZii):=Sl’v(2,j)  <  1,  v  (i,j)  e  £.  (18) 

°T 

The  correlation  function  g  is  required  to  satisfy  some  regularity 
conditions,  which  will  be  stated  in  Lemma  2.  In  general,  g  is 
a  monotonically  non-increasing  function  of  the  edge  length, 
since  amount  of  correlation  usually  decays  as  nodes  become 
farther  apart.  Moreover,  g(0)  =  M  <  1,  or  the  so-called 
nugget  effect,  according  to  geo-statistics  literature  [35],  [36]. 
It  has  been  observed  in  mining  applications,  where  the  micro¬ 
scale  variation  is  assumed  to  be  caused  by  the  existence  of 
small  nuggets  of  the  enriched  ore.  Many  other  ecological 
phenomena  such  as  soil  bacteria  population  [37],  aquatic 
population  [38]  etc.  also  exhibit  this  behavior.  Note  that  the 
presence  of  nugget  effect  has  the  same  effect  on  correlation  as 
imposing  an  exclusion  region  on  how  near  two  nodes  can  be 
placed.  However,  for  such  an  exclusion  constraint  to  hold,  we 
need  more  complicated  node  placement  distributions  than  the 
uniform  or  Poisson  assumption.  Although  such  distributions 
can  be  handled  in  principle,  they  are  not  analytically  tractable. 
Some  examples  of  the  correlation  function  are 

g(R)  =  Me-“fl,  g(R)  =  a  >  0, 0  <  M  <  1. 

Note  that  these  conditions  together  with  an  acyclic  dependency 
graph  Q  assure  that  the  covariance  matrix  Ej  y  is  positive 
definite.  This  is  because  V  i,j  €  V, 

EliV(i,t)E1,vO'»j)-S?|V(*,i)  =  al[  1-  II  92{Rki)\  >  0, 

(/c,/)Gpath(2,j) 

where  path(f,_7)  is  the  unique  edge-path  between  i  and  j  in 
graph  Q  if  it  exists.  From  Theorem  1,  we  have  |E|  >  0. 

B.  Nearest-Neighbor  Graph 

We  assume  the  dependency  graph  to  be  the  nearest-neighbor 
graph.  The  nearest-neighbor  function  of  a  node  i  £  V,  is 
defined  as, 


nn(?'):=arg  min  dist(i,j),  (19) 

where  dist(-,  -)  is  the  Euclidean  distance.  The  inter-point 
distances  are  unique  with  probability  1,  for  uniform  and 
Poisson  point  sets  under  consideration  here.  Therefore,  nn(i) 
is  a  well-defined  function  almost  surely.  The  nearest-neighbor 
(undirected)  graph  G(V,£)  is  given  by 


(i,j)  e  £ 


Ei,v(l  i)-=a \  >0,  i  =  1, . . . ,  n. 


(16) 


i  =  nn (j)  or  j  =  nn(i). 


(20) 
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_  Biroots  of 

of  directed  NNG 

Directed  NNG 
—  Undirected  NNG 


Fig.  3.  Directed  &  undirected  versions  of  nearest-neighbor  graph. 
[  No.  of  undirected  edges  =  No.  of  directed  edges  -  0.5  x  No.  of 
biroots.] 


Fig.  4.  Illustration  of  point  process  Un .  a  or  'Pn , a  :  n  nodes  distributed  i.i.d. 
uniform  or  Poisson  in  regular  Borel  regions  (such  as  squares  or  circles)  of 
area  with  constant  density  A.  For  error  exponent,  we  consider  n  — -  oo. 


NNG  has  a  number  of  important  properties.  It  is  acyclic  with 
a  maximum3  node  degree  of  6  [40]. 

In  section  VI-C,  it  turns  out  that  we  need  to  analyze  the 
directed  NNG,  in  order  to  obtain  the  final  form  of  the  error 
exponent.  We  now  mention  some  of  its  special  properties.  The 
directed  NNG  G'(V,£')  is  defined  by 

£’  =  {<  *,nn(i)  >,i€  V},  (21) 

For  a  directed  NNG  with  at  least  two  nodes,  each  connected 
component  contains  exactly  one  2-cycle.  This  is  known  as 
the  biroot  of  the  component  [40].  See  Fig. 3.  Also  note,  the 
directed  NNG  counts  the  edges  from  these  biroots  twice,  while 
the  undirected  version  counts  only  once. 


C.  Expression  for  Log-Likelihoocl  Ratio 

Since  the  NNG  is  acyclic,  equations  (11-12)  are  valid.  We 
incorporate  additional  assumptions  (16-18)  in  the  theorem  to 
obtain  the  LLR  for  detection. 

Theorem  2  (Log-Likelihood  Ratio):  Under  the  assumptions 
(16-18),  the  log-likelihood  ratio  in  (15)  for  the  hypothesis¬ 
testing  problem  in  (14),  given  an  arbitrary  point  set  V  = 
{l,...,n},  is 


LLR(Y„,  V) 


i  ai 

n  log  — 


ct  o 


Y- 


+ 

(bj)e£ 

i<j 


g2(RtJ)  Y?  +  Y? 
1  -  92{Rii)  cr? 

2 g(Rij)  y.Y,) 
i -g2{Ri3)  Jj’ 


(22) 


where  Rij  is  the  Euclidean  edge  length  of  (i,j)  £  £,  that 
depends  on  the  configuration  ofV.  The  condition  i  <  j  ensures 
that  every  edge  is  counted  only  once. 

Theorem  2  gives  a  closed-form  expression  for  the  log- 
likelihood  ratio,  in  terms  of  the  edges  of  the  nearest-neighbor 
dependency  graph  of  the  GMRF.  Note  in  (22),  the  cross-terms 
are  only  between  the  neighbors  of  the  dependency  graph, 
which  can  be  exploited  to  yield  explicit  data-fusion  and  routing 
schemes  [41]. 


■"'The  node  degree  is  finite  for  NNG  in  any  dimension  and  is  called  the 
kissing  number  [39], 


V.  Neyman-Pearson  Error  Exponent 


The  spectrum  of  the  log-likelihood  ratio  is  defined  as  the 
distribution  of  the  normalized  LLR  evaluated  under  the  null 
hypothesis.  In  [25,  Theorem  1],  it  is  proven  that  for  Neyman- 
Pearson  detection  under  a  fixed  type-I  error  bound  4,  the  LLR 
spectrum  can  fully  characterize  the  type-II  error  exponent  of 
the  hypothesis-testing  system  and  is  independent  of  the  type-I 
bound. 

A  special  case  of  this  result  is  when  the  LLR  spectrum 
converges  almost  surely  (a.s)  to  a  constant  D 


-LLR(Yn,  V)  =  -  log 
n  n 


p[Yn-,n0] 
p[  Y„|V;Wr] 


under  Ha¬ 


iti  this  case,  the  NP  type-II  error  exponent  is  given  by  the 
above  constant  D.  In  other  words,  the  error  exponent  D  of 
NP  detection  in  (1)  is 


D:=  lira  -log  t,  undergo,  (23) 

n^oon  p[Yn|  V;  7tiJ 

where  lim  denotes  the  almost-sure  limit,  assuming  it  exists. 
Note  that  when  Y„  are  i.i.d.  conditioned  under  either  Ha 
or  Hu  the  result  reduces  to  the  Stein’s  lemma  [43,  Theorem 
12.8.1]  and  the  limit  in  (23)  to  the  Kullback-Leibler  distance. 


A.  Random  Point  Sets 

It  is  intractable  to  evaluate  the  error  exponent  D  in  (23)  for 
an  arbitrary  point  set.  Therefore,  we  assume  that  the  nodes 
are  placed  randomly,  according  to  a  point  process  defined  on 
expanding  regions.  We  consider  two  related  point  processes  : 
the  Poisson  process  and  the  binomial  point  process  on  a  large 
region,  which  we  define  below. 

Definition  2  (Poisson  and  Binomial  Processes  [44]): 

Let  (Bn)n>i  denote  a  sequence  of  squares  or  circles5  of 
area  " ,  centered  at  the  origin,  for  any  A  >  0.  A  binomial 
point  process  on  Bn,  denoted  by  consists  of  n  points 

distributed  i.i.d.  uniformly  on  Bn.  A  homogeneous  Poisson 
process  of  intensity  A  on  Bn,  denoted  by  Vn.\,  satisfies  the 
following  properties: 

1)  for  any  set  A  C  Bn  with  area  A,  the  number  of  points 
in  A  is  Poisson  distributed  with  mean  A  A, 

4The  generalization  to  an  exponential  type-I  error  bound  [25],  [42]  is  not 
tractable  since  a  closed-form  cumulative  distribution  of  the  LLR  is  needed. 

5The  results  hold  for  regular  Borel  sets  under  some  conditions  [4,  P.  1007], 
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Normalized  sum  of  edge  weights 

J2  *(Re)  — ' 

eeg?(V) 


Origin 

•  • 


• 

Expectation  of  edges 
of  origin  of  Poisson  process 
iE  0(flo,x) 

X:X6Pa 

(X,0)€5(XU0) 


Fig.  5.  Pictorial  representation  of  LLN  for  graph  functionals  of  uniform  or 
Poisson  point  sets. 


2)  for  any  n  €  N  and  A  C  B,,  with  area  A  >  0,  conditioned 
on  n  number  of  points  in  A,  the  point  process  on  A  is 
a  binomial  process. 

We  are  interested  in  evaluating  the  error  exponent  under 
both  the  binomial  or  Poisson  point  processes,  when  the  mean 
number  of  nodes  goes  to  infinity,  with  fixed  node  density,  i.e., 
n  — >  oo  with  A  fixed. 


VI.  Closed-Form  Error  Exponent 
A.  Error  Exponent  as  a  Graph  Functional 

In  order  to  derive  the  error  exponent,  we  cast  the  error 
exponent  as  the  limit  of  sum  of  node  and  edge  functionals 
of  the  dependency  graph  of  a  marked  point  set  in  the  lemma 
below.  This  formulation  is  required  in  order  to  apply  the  law 
of  large  numbers  for  graph  functionals. 

Lemma  1  (D  as  a  Graph  Functional):  Given  the  marked 
point  set  V  drawn  from  the  binomial  process  U,,.\  or  the 
Poisson  process  Vn,x,  with  marking  variable  Yt  '"'A1  _A/"(0,  Og), 
the  error  exponent  D  in  (23)  is  given  by  the  limit  of  sum  of 
edge  and  node  functionals  of  the  nearest-neighbor  graph  as 


D 


i  ai 

=  log  — 

Co 


lim  — 

n—t oo  2 n 


i  rw  i 


iev  1  0 


+  J2  l10^1 

(®,j) 

i<j 


g2(Rij )]  + 


g\RlJ)  Y?  +  Y? 
i  -g2(Rij) 


2  g(Rij)  YjYj^ 

l-g2(Rij)  a\  f 


Y 


(24) 


where  Rij  is  the  (random)  Euclidean  edge  length  of  (i,j)  €  £, 
that  depends  on  the  underlying  point  process.  The  condition 
i  <  j  ensures  that  every  edge  is  counted  only  once. 

Proof  :  Substitute  (22)  in  (23).  □ 

In  the  lemma  above,  the  point  set  forming  the  graph  is  drawn 
from  a  marked  binomial  or  Poisson  point  process,  with  the 
marking  variable  Y,  7V(0,<7g).  This  is  because  evaluating 
the  error  exponent  (23)  under  H0  implies  that  the  sensor 
observations  Yj:  are  i.i.d.  and  independent  of  the  locations  of 
the  nodes,  and  therefore  can  be  viewed  as  a  marking  process. 


B.  Law  of  Large  Numbers  for  Graph  Functionals 

The  law  of  large  numbers  for  functionals  on  graphs  enables 
us  to  evaluate  the  limit6  in  (24).  This  law  applies  to  graphs 
which  are  random  in  the  sense  that  the  vertex  set  is  a  marked 
random  point  set.  LLN  on  graphs  is  based  on  the  so-called 
objective  method.  Steele  [45]  coined  this  term  for  a  philosophy 
whereby,  loosely  speaking,  one  describes  the  limiting  behavior 
of  functionals  on  finite  point  sets  of  binomial  process  in  terms 
of  related  functionals  defined  on  infinite  Poisson  point  sets. 
Also  see  Fig. 5.  Penrose  and  Yukich  [4],  [33],  [34]  introduce  a 
concept  of  stabilizing  functionals  and  use  the  objective  method 
to  establish  a  strong  law  of  large  numbers  for  graph  functionals 
[33,  P.  287].  In  order  to  apply  this  law,  some  conditions  need 
to  be  satisfied  in  terms  of  bounded  moments.  In  the  lemma 
below,  we  place  these  conditions  on  the  correlation  function. 

Lemma  2  ( Conditions  for  LLN):  The  graph  functional  in 
(24)  satisfies  the  conditions  for  law  of  large  numbers  for 
graph  functionals  derived  in  [33,  P.  287],  when  the  correlation 
function  g  is  monotonically  non-increasing  with  the  edge- 
lengths,  g( oo)  =  0,  and  g( 0)  =  M  <  1.  Hence,  the  graph 
functional  in  (24)  converges  almost  surely  to  a  constant. 
Proof  :  See  appendix  B. 

Theorem  3  (LLN):  Under  the  conditions  stated  in  Lemma  2, 
for  nodes  placed  according  to  Un\  or  Vn,\,  with  node  density 
A  and  region  area  j ,  from  the  law  of  large  numbers  for  graph 
functionals,  the  expression  for  the  error  exponent  D  in  (24) 
for  Neyman-Pearson  detection  of  the  GMRF  defined  by  the 
NNG  is  given  by 


2  L2 


F.  f(g{R  o.x)) 


XiXe-Px, 
(0,X)GNNG(XU0) 


■  log  K  +  —  -  1 


(25) 


where 


2r2 

/(z):=log[l^2]  +  K[1_x2],  (26) 

I\  is  the  ratio  of  variances  defined  in  (17),  and  i?o.x  are 
the  (random)  lengths  of  edge  (X,  0)  incident  on  the  origin 
in  a  NNG,  when  the  nodes  are  distributed  according  to 
homogeneous  Poisson  process  V\,  of  intensity  A. 

Proof  :  Apply  LLN  to  (24).  See  appendix  C.  □ 

In  the  theorem  above,  the  law  of  large  numbers  yields 
the  same  limit7  under  the  Poisson  or  the  binomial  process. 
Thus,  we  provide  a  single  expression  for  the  error  exponent 
under  both  the  processes.  Also,  the  above  theorem  provides 
the  error  exponent  in  terms  of  the  expectation  of  a  graph 
functional  around  the  origin,  with  the  points  drawn  from  an 
infinite  Poisson  process.  Thus,  the  functional  is  reduced  to 
a  localized  effect  around  the  origin.  This  is  an  instance  of 
the  broad  concept  of  stabilization  which  states  that  the  local 
behavior  of  the  graph  in  a  bounded  region  is  unaffected  by 

7  Nature  of  convergence  is  convergence  of  means  and  complete  convergence 
(c.m.c.c)  and  implies  almost-sure  convergence. 

7In  general,  the  limit  is  not  the  same  for  Poisson  and  binomial  processes. 
For  a  different  problem,  we  show  that  the  error  exponents  are  affected  by  a 
random  sample  size  [46]. 
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points  beyond  a  finite  (but  random)  distance  from  that  region,  and  region  area  ?,  the  error  exponent  D  for  Neyman-Pearson 
NNG  is  one  such  stabilizing  graph  with  translation  and  scale-  detection  is 
invariance  [4,  Lemma  6.1]. 


C.  Explicit  Form  for  Nearest-Neighbor  Graph 

The  evaluation  of  the  expectation  of  the  edge  functional  in 
(25)  is  complicated  and  needs  further  simplification.  In  order 
to  obtain  the  final  form  of  the  exponent,  we  exploit  some 
special  properties  of  the  NNG.  It  turns  out  that  the  expectation 
term  is  easier  to  evaluate  for  the  directed  nearest-neighbor 
graph  rather  than  the  undirected  version.  We  therefore  split  the 
sum  of  edge  functionals  in  (24),  using  the  fact  that  the  directed 
NNG  counts  the  weights  from  biroots  or  mutual  neighbors 
twice,  while  the  undirected  version  counts  only  once.  See 
Fig. 3.  We  therefore  split  the  sum  of  the  edge  functionals  of 
the  undirected  NNG  as 


E  f(9(Re))  =  E  fMRe)) 

eSNNG(V)  e6DNNG(V) 

-\  E  f(9(Re)),  (27) 

eSMNNG(V) 

where  NNG(V),  DNNG  and  MNNG  C  DNNG  are  the  undi¬ 
rected  NNG,  the  directed  NNG,  and  edges  between  the  biroots 
or  the  mutual  neighbors  of  the  directed  NNG,  respectively. 
Now,  we  evaluate  the  expectation  for  the  two  terms  separately, 
since  expectation  is  linear.  A  similar  approach  is  employed  in 
[47], 

We  now  provide  an  expression  for  the  limit  of  the  edge 
functional  based  on  the  distribution  of  distances  of  the  directed 
NNG,  which  are  related  to  hitting  or  vacancy  probabilities  of 
the  spatial  point  process,  which  are  typically  exponential  or 
gamma  distributed,  similar  to  their  one-dimensional  counter¬ 
parts  [48]. 

Lemma  3  (Expectation  of  Edge  Functional):  The  expecta¬ 
tion  term  of  the  edge  functional  in  (25)  is  given  by 


X:X6Pi, 

(O.X)eNNG(XUO) 


^Ef(g(Z2)),  (28) 


where  Z\  and  Z2  are  Rayleigh  distributed  with  variances 
(27tA)_1  and  (2wA)_1,  and  to  is  given  by 


LO  = 


47T 

3 


73 

2 


5.06, 


(29) 


and  is  the  area  of  the  union  of  two  unit-  radii  circles  with 
centers  unit  distant  apart. 

Proof  :  See  appendix  D.  □ 

In  the  theorem  below,  we  combine  Lemmas  2,  3,  and 
Theorem  3  to  obtain  the  final  form  of  the  error  exponent. 

Theorem  4  (Expression  for  D):  Under  the  assumptions 
(16-18)  and  conditions  stated  in  Lemma  2,  for  a  GMRF  with 
NNG  dependency  and  correlation  function  g  and  nodes  drawn 
from  the  binomial  or  the  Poisson  process  with  node  density  A 


Dg(K,  M,  A)  =  *  [EfigiZJ,  K)  -  |-E f(g(Z2),K) 

+  logiT+-^-l],  (30) 

where 


f(x,  A'):=log[l 


2i  2a;2 
X  ]  +  K[1  -  a;2] ' 


(31) 


Z\  and  Zi  are  Rayleigh  distributed  with  second  moments 
(27tA)_1  and  (2wA)-1. 

The  above  theorem  holds  for  any  general  correlation  func¬ 
tion.  In  (30),  except  for  the  first  two  /-terms  which  capture 
the  correlation  structure  of  the  GMRF,  the  remaining  terms 
represent  the  detection  error  exponent  for  two  IID  Gaussian 
processes.  In  the  corollary  below,  we  specialize  (30)  to  the  case 
of  constant  correlation.  In  this  case,  the  two  /-terms  reduce 
to  a  single  term. 

Corollary  1  ( Constant  Correlation ):  For  constant  values  of 
the  correlation,  the  error  exponent  D  is  independent  of  the 
node  density  A  and 

1)  for  constant  positive  correlation  or  g{Re)  =  M  < 
1,  Ve  £  £,  we  have 


D{K,  M) 


2il0g 

+  (1- 


k+k~1 


(32) 


where  /  and  u  are  given  by  (26)  and  (29). 

2)  for  the  independent  case  or  g(Re)  =  0,  Ve  £  £,  we  have 


D(K,  0) 


k  +  k~1 


(33) 


In  the  above  corollary,  we  verify  that  (32)  reduces  to  (33), 
on  substituting  M  =  0.  In  (32),  the  effect  of  correlation  can 
be  easily  analyzed  through  the  sign  of  the  function  f(M ,  K). 
Also, 


for  K  >  ,  E-  (34a) 
for  K  <  2.  (34b) 

Therefore,  at  large  variance-ratios,  the  presence  of  correlation 
hurts  the  asymptotic  performance,  when  compared  with  the 
independent  case.  But  the  situation  is  reversed  at  low  values  of 
the  variance  ratio  and  the  presence  of  correlation  helps  in  de¬ 
tection  performance.  In  the  next  section,  we  will  draw  similar 
conclusions  when  the  correlation  function  is  the  exponential 
function  through  numerical  evaluations. 


l  >  0 
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(a)  Different  values  of  correlation  coefficient  a,  nugget  M  =  0.5.  (b)  Different  values  of  nugget  M,  correlation  coefficient  a  =  0.5. 

Fig.  6.  Error  exponent  D  vs.  ratio  of  variances  K,  node  density  A  =  1.  See  (32-35). 


D.  Numerical  Results 

In  this  section,  we  focus  on  a  specific  correlation  function 
namely  the  exponential-correlation  function, 

g{R)  =  Me~aR,  a  >  0, 0  <  M  <  1.  (35) 

Using  Theorem  4,  we  numerically  evaluate  D  through  Monte- 
Carlo  runs.  In  (30),  the  error  exponent  is  an  implicit  function  of 
the  correlation  coefficient  a ,  through  the  correlation  function 
g.  For  fixed  values  of  K  and  M,  we  have 

D(K,  M,  A,  a)  =  D{K,  M,  1,  ^=),  (36) 

which  we  obtain  by  changing  the  integration  variable  in  the 
expectation  term  in  (30).  Therefore,  in  terms  of  the  error 
exponent,  increasing  the  node  density  A  is  equivalent  to  a 
lower  correlation  coefficient  at  unit  density.  Here,  we  plot  only 
the  effects  of  correlation  coefficient  a  and  nugget  M  on  D. 

In  Fig. 6(a),  we  plot  the  error  exponent  at  A  =  1  and 
M  =  0.5,  for  different  values  of  correlation  coefficient  a. 
Note,  the  cases  a  =  0  and  a  — >  oo  correspond  to  (32)  and 
(33).  We  notice  that  a  more  correlated  GMRF  or  the  one  with 
smaller  a,  has  a  higher  exponent  at  low  value  of  K,  whereas 
the  situation  is  reversed  at  high  K.  Equivalently,  increasing 
the  node  density  A  improves  the  exponent  at  low  value  of  K, 
but  not  at  high  K.  Also,  when  the  variance  ratio  K  is  large 
enough,  D  appears  to  increase  linearly  with  K  (in  dB),  and 
the  correlation  coefficient  a  and  nugget  M  appear  to  have 
little  effect,  as  expected  from  Theorem  4.  In  Fig. 6(b),  we  plot 
the  exponent  at  constant  correlation  coefficient  a  =  0.5  for 
different  values  of  the  nugget  M.  Also  note,  M  =  0  reduces 
to  the  independent  case.  We  notice  a  similar  behavior  as  the 
correlation  coefficient.  A  higher  value  of  M  results  in  a  higher 
exponent  at  low  K,  but  not  at  high  K. 

VII.  Conclusion 

In  general,  finding  the  closed  form  detection  error  exponent 
is  not  tractable.  The  graphical  structure  of  the  Markov  random 
field  allows  us  to  exploit  existing  results  in  spatial  probability 
literature.  We  employed  the  law  of  large  numbers  for  graph 


functionals  to  derive  the  detection  error  exponent  for  a  Gauss- 
Markov  random  field  with  nearest-neighbor  dependency  graph. 
We  then  investigated  the  influence  of  model  parameters  such 
as  the  variance  ratio  and  the  correlation  function  on  the  error 
exponent. 

In  this  paper,  we  have  assumed  identical  variance  at  every 
sensor.  However,  a  spatially  varying  SNR  model  can  be 
incorporated  into  our  results.  We  have  focused  on  the  GMRF 
defined  by  the  acyclic  dependency  graph  and  derived  the 
exponent  for  the  nearest-neighbor  graph.  This  is  a  simplifying 
assumption.  Although,  the  law  of  large  numbers  is  valid  for 
a  number  of  proximity  graphs,  which  have  edges  between 
“nearby”  points,  the  actual  evaluation  of  the  log-likelihood 
ratio  and  the  exponent  are  intractable  for  most  of  these  graphs. 
We  have  not  considered  correlation  under  null  hypothesis  for 
which  one  requires  a  LLN  with  correlated  marks.  We  have 
also  not  considered  the  case  when  the  signal  field  is  not 
directly  observable,  resulting  in  a  hidden  GMRF.  The  sparse 
structure  of  the  potential  matrix  is  no  longer  valid  under  such 
a  scenario.  However,  note,  GMRF  with  small  neighborhood 
has  been  demonstrated  to  approximate  the  hidden  GMRF  [49] 
as  well  as  Gaussian  field  with  long  correlation  lengths  [50], 
reasonably  well. 

The  error  exponent  can  be  employed  as  a  performance 
measure  for  network  design.  In  [51],  we  utilize  the  closed 
form  derived  in  this  paper  to  obtain  an  optimal  node  density 
that  maximizes  the  exponent  subject  to  a  routing  energy  con¬ 
straint.  We  have  also  proposed  minimum  energy  data  fusion 
and  routing  schemes  that  exploit  the  correlation  structure 
of  Markov  random  field  in  a  related  publication  [41].  We 
further  investigate  tradeoffs  between  the  routing  energy  and 
the  resulting  error  exponent  in  [52]. 
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Appendix 


A.  Proof  of  Theorem  1 

Using  the  expression  AS  =  I,  we  have  the  following 
identities: 


A(i,i) 


E 


S  (i,i) 


1 

S  (*,*)’ 


(37) 


A(i,j) 


3) 


E 

feeAT(i) 


=  0,  Vj  e  A/"(i),  (38) 


where  (37)  is  obtained  by  the  sum-product  of  zth  row  and 
i'b  column  of  A  and  S.  Similarly,  (38)  is  obtained  by  sum- 
product  of  i*  row  of  A  and  j"'  column  of  S  and  dividing  by 
S In  (38),  by  acyclicity  for  k  £  A f(i)  and  k  j,  we 
have  j  k.  From  (8),  we  have 


Z{j,  k ) 

£(' i,j) 


S  (i,k) 


V  j,  k  £  Af{i),k  j. 


Subtracting  (38)  from  (37),  only  the  terms  with  A(i,j)  survive 
and  hence,  we  obtain  A(i,j).  Substituting  all  the  A(i,j)’ s  in 
(37),  we  obtain  A(i,i).  Hence,  all  the  coefficients  of  potential 
matrix  A  are  given  by  (11). 

Let  |A(”)|  be  the  determinant  of  the  potential  matrix  of  n 
nodes.  Assume  n  >  1,  since  we  have  |A(11|  =  E(l,l)_1. 
The  determinant  of  the  potential  matrix  is  the  product  of 
determinants  of  the  connected  components.  We  therefore  con¬ 
sider  only  one  component  G'{V',£')  C  Q.  Assume  Q'  has 
at  least  one  edge,  otherwise  we  have  for  diagonal  matrix 
|A(ra)|  =  Iliev'  Since  Q'  is  acyclic,  it  has  a  leaf, 

i.e.,  there  is  some  vertex  a  with  degree  1.  Let  b  be  its 
only  neighbor.  We  assume  the  vertices  have  been  ordered 
V  =  {Vi,...,Vn}  so  that  14_1  =  b,Vn  =  a.  Then  A("> 
has  the  following  form 

....  .  0 

....  .  0 

A(n—  l,n—  1)  A[n—  l,n) 

•••  0  A(n,n—  1)  A(n,n ) 

from  (11), 


A<">  = 


0 


where  we  have 


A(n,  n ) 
A(n  —  1,  n) 
A(n  —  1,  n  —  1) 


E(n  —  1  ,n—  1) 

[S(n,  n)S(n  —  1,  n  —  1)  —  E(n,  n  —  l)2]  ’ 
— S(n,  n  —  1) 


[£(n,  n)S(n  —  1,  n  —  1)  —  E(n,  n  —  l)2]  ’ 

1  m  i  ^  E(n,n-1) 

— - - -  —  A(n  —  1,  n)  — - - - 

S(n  —  1,  n  —  1)  E(n  —  1,  n  —  1) 


+  C, 


where  C  represents  contributions  from  nodes  in  V'\Vn  i.e., 
with  node  Vn  removed,  and  having  an  edge  with  Vn-%. 
Multiplying  the  nth  column  by 

A(n,n—  1)  —  S(n,  n—  1) 

A(n,n )  E(n  —  l,n—  1) 

and  subtracting  it  from  (n  —  l),h  column  and  using  the 
determinant  rule,  we  have 


0 


|A^|  = 


0 


where 


0 

1,  n  —  1)  A{n  —  1,  n) 

0  A(n,n) 

(39) 


A'{n—  1,  n  —  1)  :=  A(n  —  1,  n  —  1) 
E(n,  n  —  1) 


E(n  —  1,  n  —  1) 


A(n,  n 


Hence,  we  have 

|A^™^|  =  A(n,n)|M„|,  for  n  >  1, 


!)•  (40) 


where  M„  is  the  minor  of  A(n,  n)  in  (39).  Substituting  in 
(40),  we  have  A'(n  —  l,n  —  1)  =  C,  where  as  noted  before, 
C  is  the  contributions  from  nodes  in  V'\U„  and  having  an 
edge  with  Vn-\.  This  implies  that  A'(n  —  l,n  —  1)  is  the 
coefficient  in  the  potential  matrix  for  the  subgraph  induced  by 
Since  only  Vn-i  has  an  edge  with  Vn,  coefficients  of 
nodes  other  than  Vn  and  14,-1  are  unaffected  by  the  removal 
of  14.  Hence,  M„  is  the  potential  matrix  for  the  subgraph 
induced  by  V/\14> 

M„  =  A("_1\ 


Since  V'\14  is  acyclic,  a  leaf  is  always  present,  rearrange  the 
rows  such  that  A(n-1i  has  a  leaf  in  the  last  two  rows,  i.e., 
it  has  the  same  structure  as  in  (39).  Remove  a  leaf  in  each 
step  of  the  recursion,  until  all  the  edges  are  removed,  then 
find  the  determinant  with  the  diagonal  matrix  consisting  of 
the  remaining  nodes  and  we  obtain  (12).  □ 


B.  Proof  of  Lemma  2 

We  can  regard  K,’s  as  marking,  since  under  'Hi,  they  are  i.i.d. 
independent  of  spatial  point  process.  The  strong  stabilization 
condition  is  satisfied  for  NNG  [4,  R  1023,  Lemma  6.1].  We 
therefore  only  need  to  prove  the  uniform  bounded  moment 
condition.  We  express  the  edge  functional  as  the  sum  of  two 
functionals,  for  i  ~  j,  given  by 


f  1  (  Rij  ) 
^2  (77zj ) 


—  log[l  -g2{Rij)\,  (41) 

92(Ri:)[Y?  +  Yfl  -  2ff(Rij)YiYj  ^ 
1  -92(Rij)  ■  (  j 


Given  a  finite  marked  set  V,  the  sum  functional  is  denoted  by 
H  i.e.. 


Hk(V):=  E  ^^,k  =  1,2.  (43) 

(i,j)GNNG(V) 

i,jev 

Given  Hk,  we  denote  the  add  one  cost  [33,  (3.1)],  which  is 
the  increment  in  H,  caused  by  inserting  a  marked  point  at  the 
origin  into  a  finite  marked  set  V,  by 
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Ak(V):=Hk(VU{0})-Hk(V).  (44) 

Hi  satisfies  the  polynomial-bounded  condition  [33,  (3.3)], 
since  <fi i  in  (41)  is  a  finite  function,  and  the  number  of  edges 
in  NNG  is  at  most  n  —  1,  for  n  points.  However,  the  functional 
H2  does  not  satisfy  the  polynomial-bounded  condition  since 
the  measurements  Y,  in  (42)  are  unbounded.  Instead,  we  define 
truncated  random  variable  Z  as 


Fig.  7.  Illustration  of  the  event  that  the  origin  is  a  biroot  in  the  directed 
NNG.  This  implies  that  there  is  no  other  point  in  the  union  of  the  circles 
shown  above.  See  (53). 


z=(  Y,  if  \Y\  <C\ogn,  (45a) 

1  sgn(F)Clogn,  o.w,  (45b) 

where  sgn  is  the  sign  function  and  C  >  0  is  a  constant. 
Consider  the  functionals  H!2 , <f2  by  replacing  Yra  with  Z„  in 
H2  and  (f> 2  respectively.  Now,  H'2  is  polynomially  bounded. 
Further,  we  have  lim^^co  Z  — V  Y  and  hence,  lim,,^^  (H2  — 
H2)  “4-  0. 

Definition  3  (Uniform  Bounded  Moments  for  Define 
U„,  .4  to  be  m  uniform  random  variables  on  A  €  B  and  f?o,x 
to  be  the  (random)  lengths  of  the  edge  (0,  X)  in  graph  Q 
incident  on  the  origin.  Then,  the  bounded  p-moment  condition 
[33,  (3.7)] 


C.  Proof  of  Theorem  3 

We  have  the  distribution  of  Y„  under  the  null  hypothesis 

1  /  V"  Y2 

p[Yn\V;HQ]  =  —  exP(-^2  * 

(27TC7g)2  V  2cTq 

Therefore,  the  limit  of  the  determinant  is  given  by 

log|£0,vl  , 

lmi  - - -  =  log  Co. 

n — >oo  In 

We  have  Yn=i  \  E[Yj2: 7f0]  =  a-s-  under  Ro- 
Therefore,  the  term  in  (24) 


sup  sup  E[  V]  c/)k(Ro.x.)}p 

(ox)g6 

xeum,A 

<  00,  k  =  1,2,  (46) 

is  true  for  some  p  >  1. 

Without  the  above  condition,  nothing  can  be  said  about  the 
almost  sure  convergence,  although,  by  Fatou’s  lemma,  the  limit 
of  the  LLN  would  be  a  bound  on  D. 

Since  fii  and  (j>2  are  decreasing  functions  edge  length,  with 
maximum  at  zero,  we  have 

E[  Y,  Mr o,x)F  <  CpE[M0)]p,  k  =  1, 2,  Vp  >  0, 

(O.X)GNNG(X) 

xettm,A 

where  C  is  the  kissing  number,  a  constant,  and  Deg(O)  <  C 
for  the  NNG.  Now,  </>i(0)  =  —  log[l  —  M2]  <  00,  since  p( 0)  = 
M  <  1,  and 

Mp 

E^(0)P]  <  {i-M2)pm  ~  Y>]  <  °°> 

since  Yj,  Y3  7V(0,  Uq).  Hence,  the  uniform-bounded 

moment  for  in  (46)  holds. 

Now,  we  show  the  uniform-bounded  moment  for  H  [33, 
(3.2)],  obtained  by  replacing  (f>k  in  (46)  by  A*..  The  positive 
part  of  Afc  is  bounded  by  Deg(0)</>fc(0),  whose  expectation  is 
shown  to  be  finite.  For  the  negative  part  A k(pim,A)~ ^  along 
the  lines  of  [4,  Lemma  6.2],  let  l{nn(i)  =  0}  be  the  event 
that  the  origin  is  the  nearest  neighbor  of  i  €  lTrn.A  ■  Then, 
the  number  of  deleted  edges  on  adding  the  origin  is  given  by 
=  0}  <  C,  we  have  Afe(Wm^)_  <  C<j>k( 0), 
whose  expectation  is  shown  to  be  finite.  Hence,  the  bounded- 
moment  condition  for  H  holds  and  LLN  is  applicable.  □ 


±_\Y? 

<Tq/  n 


-  1. 


By  Lemma  2,  the  conditions  for  LLN  hold  and  therefore  as 

n  — >  00, 


--  J2  loS  [l~92{Re)} 


eGNNG(V) 


— E  ^  log[l-p2(i?.0,x)],  (48) 


X:XGPx 

(O.X)GNNG(X) 


1  E 

(i,i)£NNG(V) 

i<j 


g2(Rij)[Y2  +  Y2]-2g(Rij)YiYj 
[1  -g2(Rij)]a2 


E 


Y  2g2(i?  o-x)  (49) 


X:XGPx 
(O.X)GNNG(X) 


where,  in  (49)  we  first  take  the  expectation  over  Yf  s  and 
use  the  fact  that  E[^°^x]  =  2(^)2  and  E[Y0Vx]  =  0. 
Collecting  all  the  terms  we  have  (25).  O 


D.  Proof  of  Lemma  3 

We  use  an  approach  similar  to  [47].  Let  BZ(X.)  denote  a 
circle  of  radius  z,  centered  at  X.  We  take  expectation  on  both 
sides  of  (27)  for  graphs  over  all  the  Poisson  points  XU0.  Let 
NNG(V),  DNNG  and  MNNG  C  DNNG  be  the  undirected 
nearest-neighbot  graph,  the  directed  nearest-neighbor  graph, 
and  edges  between  the  biroots  or  the  mutual  neighbors  of  the 
directed  nearest-neighbor  graph.  See  Fig. 3. 
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E[  53  /(fl(i?0,x))]=E[  53  /(5(-Ro,x))] 

X:Xe-PA  XiXG-Pa, 

(0,X)gNNG(X)  (0,X)gDNNG(X) 

-^E[  53  f(g(Ro,x))].  (50) 

X:XG-Pa 

(0,X)€MNNG(X) 

The  first  term  on  the  right-hand  side  in  (50)  simplifies  as 

E[  53  f(g(R0^))]=E[f(g(Z1))],  (51) 

X:X  evx, 

(0,X)GDNNG(X) 

where  Z\  is  the  unique  directed  nearest-neighbor  distance  of 
the  origin  with  points  distributed  according  to  V\,  the  Poisson 
point  process  of  intensity  A  on  3J2.  The  random  variable  Z\ 
is  like  a  waiting  time,  and  can  be  visualized  as  the  time  taken 
for  an  inflating  circle  to  first  touch  a  point  from  the  Poisson 
process.  We  therefore  have  Zi>  z  iff.  Bz( 0)  does  not  contain 
any  points  from  the  Poisson  process,  i.e., 

P[Zi  >  z]  =  P0X  ^  0  G  Bz  (0)  n  Tx]  =  e”A’I’z2 .  (52) 

Therefore,  Z\  is  Rayleigh  with  second  moment  (27rA)_1. 

Similarly,  for  the  second  term,  we  need  to  find  the  PDF 
of  the  nearest-neighbor  distance  of  the  origin  when  the  origin 
is  a  biroot  or  a  mutual  nearest  neighbor.  This  event  occurs 
when  the  union  of  the  circles  centered  at  origin  and  its  nearest 
neighbor  contains  no  other  Poisson  point.  See  Fig. 7.  Let  A  be 
the  intersection  of  the  events  that  the  directed  nearest-neighbor 
distance  of  origin  lies  in  the  interval  [z,  z  +  dz]  and  the  event 
that  origin  is  a  biroot 

A  :=  (V\  D  (Bz(0)  U  i?2(nn(0)))\{0,  nn(0)}  =  0) 

C\(Zi  g  [z,  z  +  dz]).  (53) 

Its  probability  is  given  by, 


From  (27,  52,  56),  we  obtain  (28).  □ 
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